An English-To-Korean Machine Translator: MATES/EK

نویسندگان

  • Key-Sun Choi
  • Seungmi Lee
  • Hiongun Kim
  • Deok-Bong Kim
  • Cheol Jung Kweon
  • Gil-Chang Kim
چکیده

This note introduces at, English-to-Korean Machiue 'lYanslation System MA!t'ES/EK, which ha.s been developed as a research prototype and is still under upgrading in KAIST (Korea Advanced Institute of Science and Technology). MATES/EK is a transfl:r.-ba.~ed system and it has several subsystems tt, at can be used to support other MT-developments. They are gram-. mar developing environment systems, dictionary developing tools, a set of angmented context free gritm Inars for English syntactic analysis, and so on. 1. I n t r o d u c t i o n An ],;nglish-to-Korean mactdne translat ion system M A T E S / E K has been developed through a co-research done by KAIST and SF, RI (Systmns Engineering l{.esearch Ins t i tu te) from 1988 to 1992, and is still under ewflution in KA1ST. It has several tools snpport ing system dew~.lopment, such as the g rammar writing language and its de veloping environment, a set of augmented context free g rammar for English syntact ic analysis, ;rod dict ionary editor. This system has been developed for UNIX workstat ion. M A T E S / E K was originally developed using Common Lisp in order to test the possibility of English-to Korean machine t ranslat ion, and then it has been tota l ly reconstructed using C language. Its main target dmmfin is elect r ic /electronic papers and so the dict ionary ~tnd the grammars are specitically adjusted to the do*This research is partly supported by Center for ArtiIicial Intelligence Research (CAIR) (1992). main and one of sample sentences is II"li;F, computer m;tgazine September 1991 issue to test and ewduate the system. 2. O v e r v i e w of T h e S y s t e m M N I ' E S / E K is a typical transfi 'x-based system, which does English sentence analysis, tnmsforms the result (parse tree) into an intermediate repre-. sentation, and then transRn'ms it into ~L Korean syntactic s tructure to construct ~t Korean sentmlce. Figure 1 de.picts the ow'.rall configuration of M N I ' E S / E K , which has R)llowing features: * Morphological Analysis Using l'¢-gr;ml : We resolve tile category ambiguities by combining the N-gram and the rules. (Kim, 1992) . Augmented Context Free ( ' ,ranlmars for Ellglish Syntactic An~dysis : We dew'.loped a set of augmented context free gr~mmu~r rules ['or general English synt~mtic analysis a, td the. ~malyzer is implemented using '/bnfita~ LI{. parsing algori thm (Tomita, 1987). , I,exical Semantic Structure (LSS) to repre-sent the intermediate representat ion : The result of the syntact ic s t ructure is trans.tbrmed into an intermediate representa tkm LSS, which is a dependency s t ructure tha t is relatively independent to specific languages. In I, SS, the consti tuents in a sentence are combined only in head-dependent relation based on the lexieal categories, and there art.' no order relatkm between the const i tuents. Hence LSS is desirable for trans-

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Applying Statistical Post-Editing to English-to-Korean Rule-based Machine Translation System

Conventional rule-based machine translation system suffers from its weakness of fluency in the view of target language generation. In particular, when translating English spoken language to Korean, the fluency of translation result is as important as adequacy in the aspect of readability and understanding. This problem is more severe in language pairs such as English-Korean. It’s because Englis...

متن کامل

Evaluation Results of Concept-based Translator with Partial Parsing

In this paper, we describe an evaluation of the output of the translator using conceptbased grammars. This translator translates the Korean sentence generated by a speech recognizer into an English sentence through a concept analysis approach. A partial parsing function added to the translator and obtained better improvement because the performance of the parser (whole parser) is low in the sta...

متن کامل

Hybrid Approaches to Improvement of Translation Quality in Web-based English-Korean Machine Translation

The previous English-Korean MT system that was the transfer-based MT system and applied to only written text enumerated a following brief list of the problems that had not seemed to be easy to solve in the near future : 1) processing of non-continuous idiomatic expressions 2) reduction of too many ambiguities in English syntactic analysis 3) robust processing for failed or illformed sentences 4...

متن کامل

Machine and Web Translator for English to Bangla Using Natural Language Process

The modern web contents are mostly written in English and developing a system with the facility of translating web pages from English to Bangla that can aid the massive number of people of Bangladesh. It is very important to introduce Natural Language Processing (NLP) and is required to developing a solution of web translator. It is a technique that deals with understanding natural languages an...

متن کامل

English-Korean Patent Translation System: FromTo-EK/PAT

This paper addresses a method for customizing an English-Korean machine translation system from general domain to patent domain. The customizing method includes the followings: (1) extracting and constructing large bilingual terminology and the patent-specific translation patterns, (2) adapting the probabilities of POS tagger trained from general domain to the patent domain, (3) syntactically a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994